Amendments to the Claims 



1 . (Previously Presented) In a scheduler having at least one target 
2 processor, a method for ordering instructions in a code file having a plurality of 

instructions, the method comprising: 
4 (a) determining dependencies between instructions in said plurality of 

instructions; 

6 (b) creating a directed acyclic graph showing said dependencies in said 

plurality of instructions, where said directed acyclic graph's nodes each correspond to an 
8 instruction from said plurality of instructions; 

(c) identifying one or more queues, including a first queue; 
10 (d) traversing said directed acyclic graph in a dependency-preserving manner; 

(e) creating a ready set of nodes comprising nodes in said directed acyclic 
12 graph having corresponding instructions in a ready state; 

(f) finishing if there are no nodes having corresponding instructions in a 
14 ready state; 

(g) identifying a threshold level in said first queue, said threshold 
16 corresponding to a maximum desirable fullness of said first queue; 

(h) if said first queue is less full than said threshold, choosing a node in said 
18 ready set that corresponds to an instruction that would increase the fullness of said queue, 

if any such node exists in said ready set; 
20 (i) if said first queue is at least as full as said threshold, choosing a node in 

said ready set that corresponds to an instruction that would decrease the fullness of said 
22 queue, if any such node exists in said ready set; 

(j) if no node is chosen in (h) or (i), heuristically choosing a node in said 
24 ready set; 

(k) removing said chosen node from said directed acyclic graph; 
26 (1) modifying said first queue in accordance with said chosen node and its 

corresponding instruction; 
28 (m) modifying the order of instructions in said code file in accordance with 

said chosen node and its corresponding instruction; and 
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(n) continuing processing at (d). 



2. (Previously Presented) The scheduler of claim 1, wherein: 
2 said first queue is configured to facilitate scheduling of the instructions; 

said node chosen in (h) corresponds to an instruction that will add an element to 
4 said first queue; and 

said node chosen in (i) corresponds to an instruction that will remove at least one 
6 element from said first queue. 

3. (Cancelled) 

4. (Previously Presented) The scheduler of claim 1 , wherein said (c) 
2 comprises identifying a load queue, a prefetch queue and a store queue, the method 

further comprising: 

4 (aa) determining and correlating a maximum desirable number of elements for 

each of said load queue, said prefetch queue, and said store queue; and 

6 (bb) determining a precedence ordering between said load queue, said prefetch 

queue, and said store queue. 

5. (Currently Amended) The scheduler of claim 1 , wherein: 
2 said (c) comprises: 

(cl) determining a number of queues to use in accordance with a target 

4 processor; 

(c2) determining a maximum desirable fullness for each of said 
6 determined number of queues; and 

(c3) determining a precedence ordering between each of said 
8 determined number of queues; and 

each of said (h) and said (i) comprises: 
10 (hil ) choosing one of said nodes in said ready set that will change the number 

of elements in one of said determined number of queues, in accordance with said 
12 precedence order and said maximum desirable fullness, if one of said nodes can be found; 
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and 

14 (hi2[[l]]) choosing one of said nodes in said ready set that will not change 

the number of elements in one of said determined number of queues, in accordance with 

16 said precedence order and said maximum desirable fullness, if none of said nodes in said 
ready set can be found that will change the number of elements in one of said determined 

1 8 number of queues. 

6. (Previously Presented) The scheduler of claim 1 , where said 

2 ordering of said instructions in said code file uses a hardware scheduler in accordance 
with said chosen node and its corresponding instruction. 

7. (Previously Presented) The scheduler of claim 1, wherein said (1) 
2 comprises: 

(aa) adding an element corresponding to said chosen node to said first queue if 
4 said first queue is less full than said threshold; and 

(bb) removing at least one element in accordance with said chosen node from 
6 said first queue if said first queue is at least as full as said threshold. 

8. (Previously Presented) A program storage device readable by a 
2 machine, tangibly embodying a program of instructions executable by the machine to 

perform a method for ordering instructions in a code file having a plurality of 
4 instructions, the method comprising: 

(a) determining dependencies between instructions in said plurality of 
6 instructions; 

(b) creating a directed acyclic graph showing said dependencies in said 

8 plurality of instructions, where said directed acyclic graph's nodes each correspond to an 

instruction from said plurality of instructions; 
10 (c) identifying one or more queues, including a first queue; 

(d) traversing said directed acyclic graph in a dependency-preserving manner; 
12 (e) creating a ready set of nodes comprising nodes in said directed acyclic 

graph having corresponding instructions in a ready state; 
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14 (f) finishing if there are no nodes having corresponding instructions in a 

ready state; 

16 (g) identifying a threshold level in said first queue, said threshold 

corresponding to a maximum desirable fullness of said first queue; 
1 8 (h) if said first queue is less full than said threshold, choosing a node in said 

ready set that corresponds to an instruction that would increase the fullness of said queue, 
20 if any such node exists in said ready set; 

(i) if said first queue is at least as full as said threshold, choosing a node in 
22 said ready set that corresponds to an instruction that would decrease the fullness of said 

queue, if any such node exists in said ready set; 
24 (j) if no node is chosen in (h) or (i), heuristically choosing a node in said 

ready set; 

26 (k) removing said chosen node from said directed acyclic graph; 

(1) modifying said first queue in accordance with said chosen node and its 
28 corresponding instruction; 

(m) modifying the order of instructions in said code file in accordance with 
30 said chosen node and its corresponding instruction; and 

(n) continuing processing at (d). 

9. (Previously Presented) The program storage device of claim 8, 
2 wherein: 

said first queue is configured to facilitate scheduling of the instructions; 
4 said node chosen in (h) corresponds to an instruction that will add an element to 

said first queue; and 

6 said node chosen in (i) corresponds to an instruction that will remove at least one 

element from said first queue. 

10. (Cancelled) 

1 1 . (Previously Presented) The program storage device of claim 8, 

2 wherein said (c) comprises identifying a load queue, a prefetch queue and a store queue, 
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the method further comprising: 
4 (aa) determining and correlating a maximum desirable number of elements for 

each of said load queue, said prefetch queue, and said store queue; and 
6 (bb) determining a precedence ordering between said load queue, said prefetch 

queue, and said store queue. 

12. (Currently Amended) The program storage device of claim 8, 
2 wherein: 

said (c) comprises: 

4 (c 1 ) determining a number of queues to use in accordance with a target 

processor; 

6 (c2) determining a maximum desirable fullness for each of said 

determined number of queues; and 
8 (c3) determining a precedence ordering between each of said 

determined number of queues; and 
10 each of said (h) and said (i) comprises: 

(hil) choosing one of said nodes in said ready set that will change the number 
12 of elements in one of said determined number of queues, in accordance with said 

precedence order and said maximum desirable fullness, if one of said nodes can be found; 
14 and 

(hi2[[l]]) choosing one of said nodes in said ready set that will not change 
16 the number of elements in one of said determined number of queues, in accordance with 

said precedence order and said maximum desirable fullness, if none of said nodes in said 
18 ready set can be found that will change the number of elements in one of said determined 

number of queues. 

1 3 . (Previously Presented) The program storage device of claim 8, 
2 where said ordering of said instructions in said code file uses a hardware scheduler in 

accordance with said chosen node and its corresponding instruction. 

14. (Previously Presented) The program storage device of claim 8, 
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2 wherein said (1) comprises: 

(aa) adding an element corresponding to said chosen node to said first queue if 
4 said first queue is less full than said threshold; and 

(bb) removing at least one element in accordance with said chosen node from 
6 said first queue if said first queue is at least as full as said threshold. 

15. (Currently Amended) A queue modeling instruction scheduler 

2 apparatus for use in compiling a program, the apparatus executable in a device having a 

processor operatively coupled to a memory, the apparatus comprising: 
4 (a) a directed acyclic graph creation module configured to: 

(al) determine dependencies between instructions in a program to be 
6 compiled; and 

(a2) create a directed acyclic graph showing said dependencies in said 
8 program, wherein said directed acyclic graph's nodes correspond to instructions 

in said program; 

10 (b) a directed acyclic graph traversal and ready set identification module 

configured to: 

12 (b 1 ) traverse said directed acyclic graph in a dependency-preserving 

manner; and 

1 4 (b2) create a ready set of nodes; 

(c) a ready set evaluation module configured to: 

16 (c 1 ) identify which nodes in said ready set correspond to which 

instructions in said program; and 
1 8 (c2) evaluate said instructions for their effect on memory operations; 

(d) a queue management module configured to: 

20 (dl) manage at least one queue, wherein managing a queue comprises 

adding and removing elements from said at least one queue, and wherein said 
22 elements correspond to nodes from said directed acyclic graph; and 

(e) a code scheduling module operably connected to said program, said 
24 directed acyclic graph traversal and ready set identification module, said ready set 

evaluation module, said queue management module and said directed acyclic graph, 
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26 wherein said code scheduling module is configured to: 

(el) add and remove nodes to and from said directed acyclic graph; 
28 (e2) determine and correlate a maximum desirable number of elements 

identifier for said at least one queue; 
30 (e3) choose one of said nodes in said ready set that will change the 

number of elements in said at least one queue in accordance with said correlated 
32 identifier; 

( e 4[[3]]) have elements of said at least one queue added and 
34 removed; and 

( e i[[4]]) change the order of instructions in said program in 
36 accordance with said instructions, said nodes, and said at least one queue. 

1 6. (Previously Presented) The apparatus of claim 15, wherein said at 
2 least one queue comprises a load queue, a prefetch queue and a store queue, and wherein 

said code scheduling module is further configured to: 
4 (e5) determine and correlate a maximum desirable number of elements 

identifier for each of said load queue, said prefetch queue, and said store queue; 
6 (e6) determine a precedence ordering between said load queue, said prefetch 

queue, and said store queue; and 
8 (e7) choose one of said nodes in said ready set that will change the number of 

elements in one of said load queue, said prefetch queue, or said store queue in accordance 
1 0 with said precedence order and one of said correlated identifiers. 

17. (Previously Presented) The apparatus of claim 15, wherein: 
2 said queue management module is further configured to: 

(d2) determine a number of queues to use in accordance with a target 
4 processor; and 

(d3) manage said determined number of queues; and 
6 said code scheduling module is further configured to: 

(e5) determine a precedence ordering between each of said determined number 
8 of queues; and 
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(e6) choose one of said nodes in said ready set that will affect the elements in 
10 one of said determined number of queues in accordance with said precedence 

order. 

1 8. (Previously Presented) The apparatus of claim 15, wherein said 
2 code scheduler module comprises a hardware scheduler. 

1 9. (Previously Presented) A method of scheduling a set of instructions 
2 during compilation of a program comprising the instructions, the method comprising: 

(a) constructing a directed acyclic graph depicting dependencies among 
4 instructions in the set of instructions, wherein each node in the graph corresponds to an 

instruction in the set of instructions; 
6 (b) identifying a ready set of nodes representing instructions having no 

dependencies on other instructions; 
8 (c) identifying a target level of fullness within a queue configured to facilitate 

reordering of the set of instructions; 
10 (d) if the queue is less full than said target level: 

(dl) selecting a node in said ready set that corresponds to an instruction 
12 that would generate a memory operation; and 

(d2) if said ready set includes no such node, heuristically selecting a 
14 node in said ready set; 

(e) if the queue is at least as full as said target level: 
16 (el ) selecting a node in said ready set that corresponds to an instruction 

that requires completion of a previous memory operation; and 
1 8 (e2) if said ready set includes no such node, heuristically selecting a 

node in said ready set; 
20 (f) removing the selected node from the ready set and the graph; and 

(g) scheduling the instruction corresponding to the selected node. 

20. (Previously Presented) The method of claim 1 9, wherein: 

2 said instruction that would generate a memory operation comprises one of: a load, 
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a prefetch and a store; and 
4 said instruction that requires completion of a previous memory operation is an 

instruction dependent upon one or more of: a load, a prefetch and a store. 
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